System and method for delivery of near-term real-time recorded video

ABSTRACT

A system and method for delivery of near-term real-time recorded video includes a byte stream server to provide streaming video from a server platform for video that is being simultaneously recorded to files. The video can be delivered at programmable rates, including faster than real-time, real-time, and slower than real-time. The delivered video can include all video frames, or only I-frames/Keyframes.

RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No.62/559,744 filed Sep. 18, 2017, entitled SYSTEM AND METHOD FOR DELIVERYOF NEAR-TERM REAL-TIME RECORDED VIDEO which application is herebyincorporated herein by reference in its entirety.

FIELD OF THE INVENTION

This invention relates generally to security systems, surveillance andmore particularly to networked video systems and streaming video.

BACKGROUND OF THE INVENTION

In security and access control applications, video capabilities are animportant feature at access points in an individual office or a facilityincluding one or more buildings and at residences. The installation ofcameras and display monitors, the addition of new features and theoperation of conventional systems is often complicated by the use ofvarious incompatible communications channels required by the individualsystems. Managing the configuration of cameras and display monitorswhich are often remote from one another is complicated in conventionalsystems. Delivering near-term real-time recorded video from a largenumber of video sources on demand is not easily accomplished.

On video recording servers that support recording of a large number ofsimultaneous camera feeds, it is not possible, due to memorylimitations, to provide a delayed buffering scheme or queuing scheme forevery camera in the server's recording pipeline. This is particularlyrelevant for high definition (HD) cameras, where 30 seconds of videobuffering could comprise approximately 60 MB of random access memory(RAM), assuming a compressed video data rate of 16 Mbps. For a serverthat is supporting hundreds of concurrent camera connections andrecordings, the memory required to buffer all video streams for just 30seconds of immediate access and playback by an external entity (videoplayback client, e.g.) would be prohibitively large (e.g., 6 GB of RAMfor each of 100 HD cameras totaling more than a half a terabyte of RAMmemory). Additionally it is difficult to gain immediate access to videoframes on a server for playback without having to buffer the videoframes in memory.

The current state of the art is to generally use a circular queue tobuffer a small amount of data for each stream—this limits the totalnumber of streams possible to buffer, as well as the amount of frames(or time) possible to buffer for each stream or a select number ofstreams. Another conventional approach is to use very small (on theorder of 2-5 seconds) recorded media files. This has complexitiesinherent in the approach, such as the server having to manage and indexa large number of files as well as the file system churn of opening andclosing a substantially large number of video recording files,particularly when the simultaneous camera count is high. The clientmechanism is complex as well, as it needs to download or progressivelydownload a large number of recorded files, as well as manage and playback the downloaded file-based media.

It would, therefore, be desirable to provide a system which will allowimmediate access to all past video frames in time (including a smalltime delta) of any and all camera feeds being recorded on a videoserver.

SUMMARY OF THE INVENTION

Embodiments described herein provide methods for streaming video from aserver platform while the video is being simultaneously recorded tofiles. The video can be delivered at programmable rates, includingfaster than real-time, real-time, and slower than real-time. Thedelivered video can include video frames, or only I-frames/Keyframes.Frames that are actively being recorded to a video file container can beaccessed for immediate decode and playback by an external client withouthaving to buffer the frames in memory.

The methods described herein will allow almost immediate access to allpast video frames of camera feeds being recorded on a video server withonly a small time delay. Frames that are actively being recorded to avideo file container can be accessed for decode and playback by anexternal client without having to buffer the frames in memory.

Embodiments described herein include system for delivery of near-termreal-time recorded video including a byte stream server having a networkinterface, at least one client worker thread coupled to the byte streamserver, a time-based file search engine coupled to the at least oneclient worker thread, at least one file frame mapper-indexer coupled tothe at least one client worker thread, a video and index file storagesystem for storing video and index files coupled to the at least onefile frame mapper-indexer, an encapsulater coupled to the at least onefile frame mapper-indexer and the video and index file storage systemand a client playback subsystem coupled to the at least one clientworker thread and to the network interface. With such an arrangement,programmable delivery frequency, auto-continuation, Iframe only andconfigurable frame periodicity delivery modes are provided. This isaccomplished without requiring large RAM memory buffers and only uses asingle re-usable frame buffer of memory, and minimal resources (e.g.,small amount of disk I/O, and small amount of CPU).

In one embodiment, a technique to method to deliver near-term real-timerecorded video includes receiving a byte stream request including a timesearch criterion from a playback requester, initiating a client workerthread corresponding to the received byte stream request, searchingvideo and index files to find frames corresponding to the time searchcriterion; mapping and encapsulating the searched and found frames andtransmitting the encapsulated frames to the playback requester. Such amethod facilitates access and play back video almost immediately (i.e.,a very short delay) upon an event occurring at a specific time in thenear past (generally 2-7 seconds in the near past) for any camera sourcebeing recorded. This also allows the playback of near-term video to becontinually accessed toward a time of “now.”

Other embodiments of the invention that are disclosed herein includesoftware programs to perform the steps and operations summarized aboveand disclosed in detail below. One such embodiment comprises a computerprogram product that has a computer-readable medium including computerprogram logic encoded thereon that, when performed in a computerizeddevice having a coupling of a memory and a processor and a display,programs the processor to perform the operations disclosed herein. Sucharrangements are typically provided as software, code and/or other data(e.g., data structures) arranged or encoded on a computer readablemedium such as an optical medium (e.g., CD-ROM), hard disk or other amedium such as firmware or microcode in one or more ROM or RAM or PROMchips or as an Application Specific Integrated Circuit (ASIC). Thesoftware or firmware or other such configurations can be installed ontoa computerized device to cause the computerized device to perform thetechniques explained herein. Other configurations include webapplications, browsers, IP applications and data enabled deviceapplications as will be explained in more detail.

It is to be understood that the features of the systems and methods fordelivery of near-term real-time recorded video can be embodied strictlyas a software program, as software and hardware, or as hardware alonesuch as within a single processor or multiple processors, or within anoperating system or within a software application. One embodimentincludes a computer-readable non-transitory storage medium havinginstructions stored thereon for processing data information, such thatthe instructions, when carried out by a processing device, enable theprocessing device to perform operations of: receiving a byte streamrequest including a time search criterion from a playback requester,initiating a client worker thread corresponding to the received bytestream request, searching video and index files to find framescorresponding to the time search criterion; mapping and encapsulatingthe searched and found frames and transmitting the encapsulated framesto the playback requester.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing and other objects, features and advantages of theinvention will be apparent from the following more particulardescription of embodiments of the invention, as illustrated in theaccompanying drawings and figures in which like reference charactersrefer to the same parts throughout the different views. The drawings arenot necessarily to scale, with emphasis instead being placed uponillustrating the embodiments, principles and concepts of the invention.These and other features of the invention will be understood from thedescription and claims herein, taken together with the drawings ofillustrative embodiments, wherein

FIG. 1 is a schematic block diagram of a byte stream server operating inan integrated security system including networked monitor appliances andnetworked monitor controllers according to one embodiment disclosedherein;

FIG. 2 is a block diagram of the byte stream server and client playbacksystem of FIG. 1; and

FIG. 3 is a flow diagram illustrating the steps to deliver near-termreal-time recorded video from the byte stream server of FIG. 2.

DETAILED DESCRIPTION OF THE INVENTION

In embodiments described herein a byte stream server solves the problemof how to gain immediate access to video frames on a server for playbackwithout having to buffer the video frames in random access memory.

Now referring to FIG. 1, an exemplary system 100 for delivery ofnear-term real-time recorded video includes a byte stream server (BSS)102 having a network interface and a client playback subsystem 104. Thebyte stream server operates in an environment which includes networkedmonitor controllers 120 (also referred to as NMA controller) andnetworked monitor appliances 110 a-110 n (generally referred to as NMA110). The NMA controller 120 includes, but is not limited to a global orlocal security control system 130, a mobile device 140 (with or withouta camera) running a mobile application, a forensic desktop system and adigital video recording system (DVR) 150 which can be directly hardwired to cameras or controlling cameras over a network. The DVR 150 andmobile device 140 can be both an NMA controller 120 and a video source160. The NMA 110 includes, but is not limited to, a personal computer, aset-top box and a mobile device. The NMA 110 can be connected to adisplay 112 and optionally more than one display (e.g., display 112 b).The NMA 110 generally maintains a rectangular bitmap display on thedisplay 112. In one embodiment, the NMA 110 includes a client app foruse on a personal computer (PC) including one or more separate displays112 a-112 b. In another embodiment, the NMA 110 is available as anapplication on a dedicated set-top box with one or more displays. Instill another embodiment, the NMA 110 is available as an app running ona mobile device operating system (e.g., iOS or Android) with anintegrated display. The client playback subsystem 104 to request andreceive BSS transmitted frames includes the deserialization (or framing)of the video frames from the received stream of bytes received from theBSS 102.

In general, the BSS 102 does not take the place of the fully functionalDVR 150, but in certain embodiments the BSS 102 an important enhancementto the DVR 150. If the DVR 150 is configured to record live streams witha requirement to deliver frames at any time during the recording beforethe DVR 150 commits the live frames to file storage, the BSS 102 couldoffer this functionality, as long as the temporary storage mechanism inthe DVR is file based. Live video can be acquired from crowd sourcedvideo, a video recorder, an IP camera, IP-based video source andexternal video. If the frames of video were being stored in a file basedmemory map before being committed via the disk I/O subsystem to afile/container, the BSS could be used to index into that memory as wellto deliver video frames. In that embodiment, the search engine wouldsearch and return an ascending time-ordered list of memory maps for thevideo frames and associated metadata. The BSS 102 can operate inconjunction with other types of video servers in addition to DVRs. Thesevideo servers are also sourced with video from IP-based video cameras,and the BSS 102 can operate in conjunction with these servers.

Now referring to FIG. 2, the byte stream server 102 includes a networkinterface 105 and in operation initiates multiple instances clientworker threads (CWT) 202 a-202 n (generally referred to as CWT 200) inresponse to byte stream requests from users through also client playbacksubsystem 104. The byte stream server 102 and the CWT 202 initiate atime-based search engine 204 a (referred to as time-based search engine204) and a file frame mapper-indexer 206 a (referred to as file framemapper-indexer 206) coupled to a video and index file storage system 210and an encapsulater 208 a (referred to as encapsulater 208) coupled toat least one file frame mapper-indexer 206 and the video and index filestorage system 210. The system 100 further includes a client workerthread terminator 203 and a video recording engine 220 coupled to thevideo and index file storage system 210. In one embodiment, video framesare recorded in a video and index file storage system. In a furtherembodiment, a frame index entry is recorded in the video and index filestorage system.

The delivery of near-term real-time recorded video process used toretrieve frames of video from any time in the past, even the recent past(e.g. 1-2 seconds in the past from the current time) includes indexinginto the stored video files and retrieving frame data precisely from thefile containers using the index data. The frames can be efficiently readout of the file containers and encapsulated in a format for transmissionusing the standard TCP/IP protocol (via standard socket programming).The encapsulation includes a start code, the frame data, any additionaldata that can help with the playback (e.g., frame number, PTS value,etc), and a stop code. At the receive side, the receiver will use theencapsulation parameters to “frame” the data from the received bytestream, and extract the video frames for playback (or other frameprocessing).

An external client can retrieve the frames using http protocol access.The frame indexing and transmission is executed as part of the handlingof this external http request. In one embodiment request directed to aURL address can include the following optional parameters:

autocontinue=true/false (default=false)

iframeonly=true/false (default=false)

transferrate=<rate>(default is 20 msec per frame)

Throttling (or the frequency of frame transmission) is accomplished bydelaying a settable amount between each frame encapsulation andtransmission. The frequency of frame transmission can be set toreal-time (i.e. 1/frame rate), slower than real-time, or faster thanreal-time, depending on the needs of the client playback mechanism. Theprocess cannot transmit frames from the future, so afaster-than-real-time transmission process will eventually starve itselfof frames to transmit. In this scenario, a re-try mechanism is put inplace to continue the transmission process.

The delivery of near-term real-time recorded video process can run intwo modes: a first mode maps and transfers only those frames that areincluded within a precise desired search time range (e.g., map andtransfer frames from ten seconds ago to a time of “now”, where a time of“now” is a snapshot of the current time). A second mode is a bit morecomplex (referred to as an “auto-continuation” mode). Inauto-continuation mode, the process will continually map, encapsulate,and transmit frames toward a time of “now.” In other words, the processof mapping, encapsulating, and transmitting frames is able to stay justbehind the “live” video in time toward the time of “now.” The processcan do this indefinitely until the client or requestor decides toterminate the request and transmission of frames.

The Byte Stream Server 102 handles http requests from external devices,requesting frames in a specific time range from a unique source. In oneembodiment the Byte Stream Server 102 is an http server, and the requestURL is constructed as follows:http://server_ip_addr/vmsapi/bss/uuid=<uuid>:subcam=<subcam>:starttime=<StartTime>:endtime=<EndTime>:iframeonly=<0:1>:rate=<txrateinmsecs>:autocontinue=<0:1>:addCompType=<0:1>.

The front end of the BSS 102 server, which accepts the connectionrequest, can parse an optional parameter called “addCompType.” Thisprocess maintains backward compatibility with legacy clients currentlyusing the BSS for H.264 and is constructed as follows:http://server_ip_addr/vmsapi/bss/uuid=<uuid>: subcam=<subcam>:starttime=<StartTime>:endtime=<EndTime>:iframeonly=<0:1>:rate=<txrateinmsecs>:autocontinue=<0:1>:addCompType=<0:1>.

If “addCompType” is not provided, or if it provided but set to 0, theBSS encapsulation header will not include a compression type. IfaddCompType is provided and set to 1, then Compression Type (H.264=1,H.265/HEVC=2) will be included in the BSS encapsulation header.

In another embodiment, the Byte Stream Server 102 encapsulation headerfurther includes an optional compression type, so the external clientcan determine (on a frame-by-frame basis) what compression type was usedfor each video frame being transmitted (the client sets up a play-backpipeline, including a decoder, based on the compression type). TheCompressionType can be an eight byte value that resides in the BSS 102header after an eight byte SubCam Number, but before an eight byte Widthvalue.

For requests handled by the BSS 102, the BSS 102 will create and executea “client worker thread” (CWT) 202 (collectively referred to as CWT's202. Each CWT 202 is unique to each specific request and handles thesearch, frame mapping/encapsulation, and transmission for that specificrequest only. The client worker thread terminator 203 handles thetermination and clean-up of all the CWTs 202, when the CWTs are finishedwith their specific work.

The time-based file search engine 204 (also referred to as Search Engine204) in response to a request in a time search range or a starting timein a continuous search, locates the files in the range, and provides anascending time ordered list of files. The actual frames within the timerange are located within the files, and usually not from file beginningto end (i.e., the desired frames are located somewhere in the middle ofthe file(s) with respect to time).

There is one instance of the search engine 204 for each CWT 202. Thesearch engine 204 determines the file list that represents the videoframes for the requested search range (Start Time to End Time). However,when the byte stream server 10 is in the auto-continuation mode, onceall the frames are delivered by the CWT 202 for the initial searchrange, the CWT 202 will automatically issue additional searches, usingnew instances of the search engine. So, a CWT 202 can use sequences ofsearch engines 204 to complete its work. However, only one instance ofthe search engine is active at any one time for each CWT 202. Once asearch engine 204 has accomplished its work, the search object instanceis deleted.

When a search is performed by each CWT 202 to locate the video filesincluding the desired time range, the target video frames are almostalways going to be contained somewhere in the middle of one or more ofthese files (i.e. not at the very start of the first file and not alwaysto the very end of the last file). The search engine 204 must locate thevideo files in range, and then the precise video frames within the videofiles for ultimate encapsulation and transmission. The video frameswithin the video files, ultimately representing the start time and endtime range, are located using the file time (file name is actually basedon time—generally second resolution), offset with the presentationtimestamp (generally millisecond resolution) for each frame. The PTS foreach frame is captured during the recording cycle in the metadata whichaccompanies each recorded video file. The key here is that the capturedPTS is zero based for the first frame within each recorded file. Thistrimming process is applied to the beginning of the file sequence (headof the first file in time range sequence) as well as to the end file(tail of the last file in time range sequence).

The time-based search engine 204 accepts a start and end time range inCoordinated Universal Time (UTC), a unique stream UUID, and a subcamnumber to locate all recorded media files and their associated metadatafiles within the time range request. The time-based search engine 204will output a list of file names in ascending time order, representingvideo found within the time based search range. The ascending time orderallows the BSS trimming process (in the file frame mapper-indexer 206)to work efficiently. The UTC file names and time-based directorystructures, along with the metadata for each file, allows the file framemapper-indexer 206 to trim the search results (i.e., find the exactvideo frames within the target video files associated with the start andend time search). The start and end time values are generally also inUTC time. The metadata index data provides the time offset within eachvideo file for every frame recorded into each video file. This timeoffset data allows the file frame mapper-indexer 206 to trim down toexact video frame locations.

The File Frame Mapper-Indexer 206 (also referred to as the mapper 206 orFrame Mapper-Indexer 206) receives the files provided by the searchengine and creates a frame map or frame vector. The frame map is anarray or vector for each frame (array of frames) and is used by theencapsulater 208 component (in each CWT) to transmit frames. The mapper206 creates a vector of frames, precisely mapped (using a trimmingprocess with respect to the search results) to the start and end time.The precise mapping is done with respect to the PTS values provided foreach frame. In one embodiment the file frame mapper-indexer 206identifies and maps the requested range of frames for transmission.During a recording cycle, a frame index file is created. This index fileis updated in real-time, as the frames arrive to be recorded, andincludes the frame type (e.g., I, P, B, frames) offset (byte writeoffset in the file), size in bytes of each video frame, and the timinginformation required to play back the video frame (e.g., PresentationTime Stamp—PTS using H.264 notation). In essence, the recording creationprocess includes the actual video recording file (e.g., an mp4 filecontainer) and the associated index file.

In one embodiment the format being used to encapsulate each video framefor transmission of encapsulated frames includes the following fields:

Variable Length of Private Boundary Delimiter 8 Byte Start Code 36Character String of UUID 8 Byte Subcam Number 8 Byte Width 8 Byte Height8 Byte Frame Rate 8 Byte Frame Type (1 = IFrame, 2 = PFrame, 3 = BFrame)8 Byte Frame Number in Sequence 8 Byte Total Number of Frames to beTransmitted 8 Byte First PTS Value in Sequence 8 Byte Last PTS Value inSequence 8 Byte PTS Value for this Frame 8 Byte ExtraDataSize (sps/pps)ExtraData of Size ExtraDataSize 8 Byte VideoFrameSize Video Frame ofsize VideoFrameSize 8 Byte Stop Code

The ExtraData in the BSS 102 header can include VPS—Video parameter set;SPS—Sequence parameter set; PPS—Picture parameter set for H.265/HEVC.H.264 only receives SPS/PPS. This is part of the extradatablockstandards for the two individual codecs.

For each frame written (or, more accurately, muxed) to the recordingfile, an index entry is added to the index file associated with therecording file. The format of the index entry is:

FrameType_Byteoffset_FrameSize_PlaybackTimeStamp.

The frame mapper/indexer will use this information to create an array orvector of the frames based on the time-based search range. These framesare encapsulated and transmitted via TCP/IP to the client.

In one embodiment, the Frame Mapper-Indexer 206 receives the start andend time of the search, as well as the complete list of files (i.e.search results) from the Search Subsystem, which includes theaccompanying metadata index files for each recording file. The FrameMapper-Indexer 206 uses the metadata indexing data, which specifiesframe type, write byte offset into the file, frame size, andpresentation time stamp for each frame, to trim the frame list withinrange to the video frames exactly representing the range within startand end times. Again, the search subsystem does not locate exact frames.It only locates files representing frames within a search range. It isthe responsibility of the Frame Mapper-Indexer 206 to take the file listand produce an array or vector of the precise time-based video framesthat represent the range of start and end time. All of these precisetime-based video frames are ultimately transmitted to satisfy thetime-based request from an external client to the BSS 102. The FrameMapper-Indexer 206 produces a vector of frames that will be encapsulated(using the BSS encapsulation header information described above) by theencapsulator 208, and transmitted by the BSS client worker thread 202.

In one embodiment, the video recorder engine 220 records received videoto files and creates a frame index mapping file that is used by the ByteStream Server 102 and associated components to deliver the appropriateframes upon request. In this embodiment, a database is not used to storeand retrieve index data for the file frame mapper-indexer 206. Insteadindex data is stored in metadata files which accompany each recording.This embodiment solution leverages the efficiencies in the systems filesystem. File index data is written out (flushed) to file at a ratecorresponding with the original video source's frame rate. In oneembodiment, the Live Video Feed (e.g., RTSP/RTP based H.264) iscompressed from an IP-based source.

The BSS 102 can map all the way back to the earliest time for any camerasource that has been recorded. In one embodiment the smallest time backis one second. The indexing metadata writes (flushes to disk) are drivenoff each camera source's frame rate, and this index data is beingflushed to the meta-index file in one second chunks. For example, if acamera source is 30 frames per second, the index file will get 30entries flushed to it in one second. If the camera is configured toproduce one frame per second, the index file will get one entry flushedto it in that same one second time. Entries could be flushed morefrequently than one second, and put more demands on the file-disk I/Osubsystem, but it is not generally necessary. In certain embodiments,one second is somewhat aggressive, but the timing is adjustable in thevideo recording engine 220. The BSS 102 provides the ability for asecurity officer, for example, to click on an event (e.g., associatedwith a door being accessed) and immediately play the video associatedwith the event. It's unlikely an individual can see an event and clickon it via a user interface (UI) within one second. However, one reasonfor the selection of one second is that it is envisioned that a systembeing configured to automatically react to and selecting video forplayback associated with an event faster than a human can do it though aUI. Even in this case, one second past access seems generally adequatefor most security applications.

The video recording engine 220 is responsible for connecting to andmanaging connections to IP-based surveillance cameras (e.g. viaRTSP/RTP/UDP or TCP). This includes receiving and ultimately recording(appropriately muxing and writing video frames to a file container—usingmp4 in one embodiment). The way this subsystem feeds into the BSS 102 isin the metadata generated for each recording file, as well as inrecording file names and directory structures used to actually write outthe recorded video. Again, a database is not used to hold any of thisinformation. The file names and directory structures are generated usingUTC time, and the files reside on the disk in the following format:yyyy/mm/dd/hh/, where yyyy=year, mm=zero-based month, dd=day, andhh=hour. The video frames themselves are muxed into the mp4 container,and the container name is generated as mmss_UUID_subcam.mp4, wheremm=minute, ss=second, UUID=unique to each camera, subcam=specifies thesub camera number for the camera, as many IP-based cameras offer morethan one camera view, typically through separate lenses. Complete withthe recorded file, residing in its appropriated time-based recordingdirectory structure and using the appropriate time-based file name, isan accompanying metadata file. For example, if live video is recordingin 2017/08/15/14/4045_UUID_0.mp4, then the corresponding metadata isbeing generated in 2017/08/15/14/4045_UUID_0.mp4.idx1. Note thatgenerally a file extension of idx1 is used. The same file name base isused for the metadata files to allow the time-based search engine 204 toeasily associate recorded file containers with their metadata.

The BSS 102 differentiates between the frame types using the NetworkAbstraction Layer (NAL) types in the compressed video bit stream duringthe recording cycle. When the video is being recorded and the index mapis created, this includes for each frame the byte offset in the recordedfile container, frame size, frame type, and presentation timestamp. Therecorded video can include all video frames, or only I-frames/Keyframes.The frame types for H.264, H.265, or MPEG-4 are I (Intra-frame), P(Predicted), and B (Bidirectional). The HEVC/H.265 standard is alsosupported. When the file frame mapper-indexer 206 and frame encapsulater208 are requested to deliver only I-frames, it can discard all thoseframes within the time search range that are not the I-frame type (i.e.,discard the P and B frames). The client decoder (not shown) can decodeI-frames by themselves, without referencing any other frames, as long asthe extra-data-block (SPS/PPS for H.264, SPS/PPS/VPS for H.265) isprovided (which is encapsulated and sent by the BSS as part of theencapsulation header). It is understood that with a compression typesuch as MJPEG, all frames are I-frames. For this recorded compressiontype, I-frame only mode request would be accompanied by a frequencyparameter for frame inclusion (i.e. all frames, every other frame, everythird frame, etc). It is possible to send only P frames or B-frames forH.264, H.265, and MPEG-4, but this would not be useful to end clientdecoder without the accompanying I-frame (and Is and Ps in the case of Bframes) for decoding.

It should also be noted that, for compressed video types that containmultiple frame types, not only can the architecture deliver I-frameonly, but it can deliver I-frames at a frequency corresponding to aninteger multiple of the original source's I-frame interval or Group ofPictures (GOP) size. For example, if the original recorded source is 30frames per second with a GOP size of 30 (i.e. I-frame every 1 second),the frames delivered in the I-frame only delivery mode can represent oneframe per second, or one frame every two seconds, etc, by discardingI-frames in order to deliver the desired I-frame frequency. In otherwords, we can discard I-frames at a set rate in the I-frame onlydelivery mode. Also note that importance of the statement that thisI-frame frequency must be an integer multiple of the I-frame frequencyof the original recorded source. The BSS 102 neither provides videotranscoding nor trans-rating pipeline in the BSS, which would berequired to deliver an I-frame mode that is not an integer multiple ofthe original GOP size. Other than in I-frame only transmission mode, thesolution is agnostic with regards to frame types. The frame type is usedis during the encapsulation of each frame in the transmission sequence.Frame type is also useful within the playback client to reconstruct thestream for decoding, if desired.

No special hardware or operating system (OS) is required to deliver ofnear-term real-time recorded video. However, faster CPUs and faster diskI/O subsystems will perform better and allow more BSS stream accessconcurrency. System Activity Monitoring/Reporting (SAR) testing on theBSS 102 indicates that the CPU and disk I/O load on the BSS 102 is quitelow for both the metadata generation required during the recordingcycle, as well as the disk access during the BSS stream access. Also,unlike a traditional media streaming server, the simultaneous accesscount to the BSS 102 is not expected to be very high.

The BSS server is not a conventional video server (e.g., Video On Demand(VOD) server and other streaming live media servers). It is a specificarchitecture, design, and implementation put together to satisfy aninstant replay (or instant playback) feature within a video recordingserver, without having to buffer video streams, which is prohibitivelyexpensive (with regards to memory). The request and transmission by theBSS could represent a “live stream” of the recorded source, delayed by asmall amount of time (typically between 1-2 seconds). However, the BSS102 is optimized to function as an instant replay server, in contrast toa “live streaming” server.

Turning now to FIG. 3 in which like reference numbers refer to likeelements of FIGS. 1 and 2 a flow diagram illustrates a process fordelivery of near term, real-time recorded video. In the flow diagram ofFIG. 3 the rectangular elements are herein denoted “processing blocks”(typified by step 310 in FIG. 3) and represent computer softwareinstructions or groups of instructions. The diamond shaped elements inthe flow diagrams are herein denoted “decision blocks” and representcomputer software instructions or groups of instructions which affectthe operation of the processing blocks, Alternatively, the processingblocks represent steps performed by functionally equivalent circuitssuch as a digital signal processor circuit or an application specificintegrated circuit (ASIC). It will be appreciated by those of ordinaryskill in the art that some of the steps described in the flow diagramsmay be implemented via computer software while others may be implementedin a different manner (e.g. via an empirical procedure). The flowdiagrams do not depict the syntax of any particular programminglanguage. Rather, the flow diagrams illustrate the functionalinformation used to generate computer software to perform the requiredprocessing. It should be noted that many routine program elements, suchas initialization of loops and variables and the use of temporaryvariables, are not shown. It will be appreciated by those of ordinaryskill in the art that unless otherwise indicated herein, the particularsequence of steps described is illustrative only and can be variedwithout departing from the spirit of the invention.

As described below, steps 310-360 generally occur on the BSS 102. Theprocess commences in step 310 when a byte stream request including atime search criterion from a playback requester is received. In step320, a client worker thread 202 corresponding to the received bytestream request in initiated by the BSS 102. In step 330 video and indexfile storage system 210 are searched to find frames corresponding to thetime search criterion. Searching a set of video and accompanying indexfile includes searching index file metadata. In step 340, the searchedand found frames are mapped and indexed. In step 350, the mapped andindexed frames are encapsulated and finally the in step 360 theencapsulated frames are the transmitted to the playback requester. Whenoperating in a continuous search mode, the CWT 202 continuously providesencapsulated frames to the client requestor.

All publications and references cited herein are expressly incorporatedherein by reference in their entirety. Having described the preferredembodiments of the invention, it will now become apparent to one ofordinary skill in the art that other embodiments incorporating theirconcepts may be used. It is felt therefore that these embodiments shouldnot be limited to disclosed embodiments but rather should be limitedonly by the spirit and scope of the appended claims.

What is claimed is:
 1. A system for delivery of near-term real-timerecorded video comprising: a byte stream server having a networkinterface; plurality of client worker threads coupled to the byte streamserver; a time-based file search engine coupled to the plurality ofclient worker threads; at least one file frame mapper-indexer coupled tothe plurality of client worker threads; a disk based video and indexfile storage system for directly storing video and index files to diskand coupled to the at least one file frame mapper-indexer, wherein theindex files include timing information for video frame playback fromrecorded video; an encapsulater coupled to the at least one file framemapper-indexer and the disk based video and index file storage system,wherein the encapsulater includes a present time stamp values, in atleast one encapsulated frame and combines at least one non-buffered,non-cached video frames directly from the disk based video and indexfile storage system and index data from the index files into the atleast one encapsulated frame; and a client playback subsystem coupled tothe plurality of client worker threads and to the network interface,wherein the system provides near-term real time playback of recordedvideo including recorded video from a plurality of camera feeds activelybeing recorded; wherein the plurality of client worker threads arelocally coupled to the time based search engine, the encapsulater, andthe byte stream server; and a user interface, (UI) coupled to theplurality of client worker threads, wherein the system provides aninstant replay of any one of the plurality of camera feeds.
 2. Thesystem of claim 1, further comprising a video recording engine coupledto the disk based video and index file storage system.
 3. The system ofclaim 2, wherein the video recording engine is adapted to receive atleast one video stream.
 4. The system of claim 3, wherein the at leastone video stream comprises at least one live video feed compressed froman IP-based video source.
 5. The system of claim 1, wherein the at leastone live video feed originates from crowd sourced video.
 6. The systemof claim 1, wherein a near-term delay between live video andtransmission of encapsulated recorded video frames is less than twoseconds while the camera feeds are actively being recorded.
 7. Thesystem of claim 1, wherein frame mapper-indexer and frame encapsulaterdeliver only I-frames.
 8. The system of claim 7, wherein framemapper-indexer and frame encapsulater deliver I-frames at a frequencycorresponding to an integer multiple of the original source's I-frameinterval.
 9. A computer-implemented method to deliver near-termreal-time recorded video comprising: receiving via a byte stream servera byte stream request including a time search criterion from a playbackrequester; initiating a client worker thread corresponding to thereceived byte stream request, wherein the client worker thread islocally coupled to the byte stream server; recording non-buffered,non-cached video frames directly to a disk based video and index filestorage system and generating at least one index file; searchingnon-buffered, non-cached recorded video and the at least one index fileto find frames corresponding to the requested time search criterion,wherein finding frames uses a time based search engine locally coupledto the client worker thread; mapping the recorded, searched and foundnon-buffered video frames, wherein mapping includes creation of theframe mapping index vector which provides the indexing into the activelyrecorded non-buffered, non-cached video frames; encapsulating themapped, recorded, searched and found non-buffered, video frames, whereinan encapsulated frame includes a presentation timestamp and a frametype, wherein encapsulation uses an encapsulater locally coupled to thedisk based video and index file storage system; transmitting theencapsulated recorded searched and found non-buffered video frames tothe playback requester; and instantly replaying video from theencapsulated recorded searched and found non-buffered video frames tothe playback requester.
 10. The method of claim 9, wherein the timesearch criterion comprises a time range.
 11. The method of claim 9,wherein the time search criterion comprises specifying anauto-continuation mode to continually map and transmit frames from astart time.
 12. The method of claim 11, further comprising playing backvideo from the start time at a faster than real time rate to stay justbehind live video.
 13. The method of claim 9 further comprisingrecording a video frame in the disk based video and index file storagesystem.
 14. The method of claim 9 further comprising recording a frameindex entry in the disk based video and index file storage system. 15.The method of claim 9, wherein searching video and index file storagesystem to find frames comprises generating an array of framescorresponding to a search criterion.
 16. The method of claim 9 furthercomprising throttling the transmission of the encapsulated, searched andfound frames to the playback requester.
 17. The method of claim 16,wherein throttling the transmission of encapsulated frames to theplayback requester comprises one of: transmitting the encapsulated,recorded, searched and found frames in real-time; transmitting theencapsulated recorded, frames faster than real-time; and wherein thetransmission of encapsulated recorded, searched and found frames occursin less than two seconds of the real time recording of the frames whileactively recording the non-buffered video frames and generating the atleast one index file.
 18. The method of claim 9, further comprisingplaying back the encapsulated frames using the presentation time stampvalues in the encapsulated frames.
 19. The method of claim 9, furthercomprising delivering only I-frames at a frequency corresponding to aninteger multiple of the original source's I-frame interval.
 20. Acomputer-readable non-transitory storage medium having instructionsstored thereon for processing data information, such that theinstructions, when carried out by a processing device, enable theprocessing device to perform operations of: receiving a byte streamrequest including a time search criterion from a playback requester;initiating a client worker thread corresponding to the received bytestream request; recording non-buffered video frames and generating atleast one index file; searching non-buffered recorded video frames andat least on index file to find frames corresponding to the requestedtime search criterion, wherein finding frames uses a time based filesearch engine locally coupled to the client worker thread; mapping andencapsulating the searched and found non-buffered video frames, whereinmapping includes creation of the frame mapping index vector whichprovides the indexing into the actively recorded non-buffered videoframes, wherein encapsulation uses an encapsulater locally coupled tothe disk based video and index file storage system to encapsulate therecorded non-buffered video frames and mapping uses a file frame mapperindexer locally coupled to the encapsulater to generate at least oneframe map; and transmitting the encapsulated, searched and foundnon-buffered video frames to the playback requester; and instantlyreplaying video from the encapsulated recorded searched and foundnon-buffered, non-cached video frames to the playback requester.